Classification and Regression Tree Method for Forecasting
نویسندگان
چکیده
Sentiment classification is a special task of text classification whose objective is to classify a text according to the sentimental polarities of opinions it contains e.g., favorable or unfavorable, positive or negative. This is especially a problem for the tweets sentiment analysis. Since the topics in Twitter are very diverse, it is impossible to train a universal classifier for all topics. Twitter is an online social networking service that enables users to send and read short 140-charcters messages called “tweets”. Moreover, compared to product review, Twitter lacks data labeling and a rating mechanism to acquire sentiment labels. The extremely sparse text of tweets also brings down the performance of a sentiment classifier. Twitter, attracts more people to post their feelings and opinions on various topics. The posting of sentiment contents cannot only give an emotional snapshot of the online but also have potential commercial, financial and sociological values. In social media, a Twitter user may have different opinions on different topics using a method called CART (classification and Regression Tree) method. CART analysis is a tree-building technique which is unlike traditional data analysis methods. Other factors which limit CART's general acceptability are the complexity of the analysis and, until recently, the software required to perform CART analysis was difficult to use.
منابع مشابه
Forest Stand Types Classification Using Tree-Based Algorithms and SPOT-HRG Data
Forest types mapping, is one of the most necessary elements in the forest management and silviculture treatments. Traditional methods such as field surveys are almost time-consuming and cost-intensive. Improvements in remote sensing data sources and classification –estimation methods are preparing new opportunities for obtaining more accurate forest biophysical attributes maps. This research co...
متن کاملExploring the Utility of the Random Forest Method for Forecasting Ozone Pollution in SYDNEY
This paper explores the utility of an ensemble decision-tree method called random forest, in comparison with the classic classification and regression trees (CART) algorithm, for forecasting ground-level ozone pollution in the Sydney metropolitan region. Statistical forecasting models are developed to provide daily ozone forecasts in November-March for three subregions, i.e., Sydney east, Sydne...
متن کاملAssessing Behavioral Patterns of Motorcyclists Based on Traffic Control Device at City Intersections by Classification Tree Algorithm
According to the forensic statistics, in Iran, 26 percent of those killed in traffic accidents are motorcyclists in recent years. Thus, it is necessary to investigate the causes of motorcycle accidents because of the high number of motorcyclist casualties. Motorcyclists' dangerous behaviors are among the causes of events that are discussed in this study. Traffic signs have the important role of...
متن کاملPredicting The Type of Malaria Using Classification and Regression Decision Trees
Predicting The Type of Malaria Using Classification and Regression Decision Trees Maryam Ashoori1 *, Fatemeh Hamzavi2 1School of Technical and Engineering, Higher Educational Complex of Saravan, Saravan, Iran 2School of Agriculture, Higher Educational Complex of Saravan, Saravan, Iran Abstract Background: Malaria is an infectious disease infecting 200 - 300 million people annually. Environme...
متن کاملPrediction of melting points of a diverse chemical set using fuzzy regression tree
The classification and regression trees (CART) possess the advantage of being able to handlelarge data sets and yield readily interpretable models. In spite to these advantages, they are alsorecognized as highly unstable classifiers with respect to minor perturbations in the training data.In the other words methods present high variance. Fuzzy logic brings in an improvement in theseaspects due ...
متن کاملمعرفی الگوریتم های مدل رده بندی درختی و کاربرد آن در تعیین عوامل مؤثر بر ابتلا به سرطان مری در استان گلستان
Background & objective: One of the common purposes of medical research is Determination of effective factors on the occurrence of event. Due to the interaction of risk factors regression models, discriminant analysis and classification procedures used. Uses of these models require making the assumption which in the medical data isn’t usually established. Therefore, alternative methods must be u...
متن کامل